Selective Inference for Hierarchical Clustering
نویسندگان
چکیده
Classical tests for a difference in means control the Type I error rate when groups are defined priori. However, instead via clustering, then applying classical test yields an extremely inflated rate. Notably, this problem persists even if two separate and independent datasets used to define their means. To address problem, article, we propose selective inference approach between clusters. Our procedure controls by accounting fact that choice of null hypothesis was made based on data. We describe how efficiently compute exact p-values clusters obtained using agglomerative hierarchical clustering with many commonly linkages. apply our method simulated data single-cell RNA-sequencing Supplementary materials article available online.
منابع مشابه
HIERARCHICAL DATA CLUSTERING MODEL FOR ANALYZING PASSENGERS’ TRIP IN HIGHWAYS
One of the most important issues in urban planning is developing sustainable public transportation. The basic condition for this purpose is analyzing current condition especially based on data. Data mining is a set of new techniques that are beyond statistical data analyzing. Clustering techniques is a subset of it that one of it’s techniques used for analyzing passengers’ trip. The result of...
متن کاملAutomatic Ontology Merging by Hierarchical Clustering and Inference Mechanisms
One of the core challenges for current landscape of ontology based research is to develop efficient ontology merging algorithms which can resolve the mismatches with no or minimum human intervention, and generate automatic global merged ontology on-the-fly to fulfil the needs of automated enterprise business applications and mediation based data warehousing. This paper presents our approach of ...
متن کاملInference of a Phylogenetic Tree: Hierarchical Clustering versus Genetic Algorithm
This paper compares the implementations and performance of two computational methods, hierarchical clustering and a genetic algorithm, for inference of phylogenetic trees in the context of the artificial organism Caminalcules. Although these techniques have a superficial similarity, in that they both use agglomeration as their construction method, their origin and approaches are antithetical. F...
متن کاملhierarchical data clustering model for analyzing passengers’ trip in highways
one of the most important issues in urban planning is developing sustainable public transportation. the basic condition for this purpose is analyzing current condition especially based on data. data mining is a set of new techniques that are beyond statistical data analyzing. clustering techniques is a subset of it that one of it’s techniques used for analyzing passengers’ trip. the result of t...
متن کاملApproximating Hierarchical MV-sets for Hierarchical Clustering
The goal of hierarchical clustering is to construct a cluster tree, which can be viewed as the modal structure of a density. For this purpose, we use a convex optimization program that can efficiently estimate a family of hierarchical dense sets in high-dimensional distributions. We further extend existing graph-based methods to approximate the cluster tree of a distribution. By avoiding direct...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the American Statistical Association
سال: 2022
ISSN: ['0162-1459', '1537-274X', '2326-6228', '1522-5445']
DOI: https://doi.org/10.1080/01621459.2022.2116331